Overview

Dataset statistics

Number of variables42
Number of observations186523
Missing cells6147
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory59.8 MiB
Average record size in memory336.0 B

Variable types

NUM29
BOOL7
CAT6

Warnings

flag_telefone has constant value "186523" Constant
nascimento has a high cardinality: 15116 distinct values High cardinality
data_contrato has a high cardinality: 84 distinct values High cardinality
tem_med_emp has a high cardinality: 189 distinct values High cardinality
tem_pri_emp has a high cardinality: 291 distinct values High cardinality
pri_emp_tom is highly correlated with pri_emp_sanHigh correlation
pri_emp_san is highly correlated with pri_emp_tomHigh correlation
sec_emp_san is highly correlated with sec_emp_abt and 1 other fieldsHigh correlation
sec_emp_abt is highly correlated with sec_emp_san and 1 other fieldsHigh correlation
sec_emp_tom is highly correlated with sec_emp_abt and 1 other fieldsHigh correlation
emprego has 6147 (3.3%) missing values Missing
pri_emp_abt is highly skewed (γ1 = 30.69564031) Skewed
pri_emp_san is highly skewed (γ1 = 306.8631852) Skewed
pri_emp_tom is highly skewed (γ1 = 305.7799348) Skewed
sec_qtd_tot_emp is highly skewed (γ1 = 28.32622344) Skewed
sec_qtd_tot_emp_atv is highly skewed (γ1 = 31.15570999) Skewed
sec_qtd_tot_def is highly skewed (γ1 = 24.27761841) Skewed
sec_emp_abt is highly skewed (γ1 = 104.3726495) Skewed
sec_emp_san is highly skewed (γ1 = 73.22473392) Skewed
sec_emp_tom is highly skewed (γ1 = 73.71455172) Skewed
par_pri_emp is highly skewed (γ1 = 70.11203895) Skewed
par_seg_emp is highly skewed (γ1 = 153.6010717) Skewed
df_index has unique values Unique
id_pessoa has unique values Unique
score has 93698 (50.2%) zeros Zeros
pri_qtd_tot_emp has 93698 (50.2%) zeros Zeros
pri_qtd_tot_emp_atv has 109771 (58.9%) zeros Zeros
pri_qtd_tot_def has 165616 (88.8%) zeros Zeros
pri_emp_abt has 113463 (60.8%) zeros Zeros
pri_emp_san has 110630 (59.3%) zeros Zeros
pri_emp_tom has 110717 (59.4%) zeros Zeros
sec_qtd_tot_emp has 181821 (97.5%) zeros Zeros
sec_qtd_tot_emp_atv has 183450 (98.4%) zeros Zeros
sec_qtd_tot_def has 185430 (99.4%) zeros Zeros
sec_emp_abt has 183819 (98.6%) zeros Zeros
sec_emp_san has 183518 (98.4%) zeros Zeros
sec_emp_tom has 183542 (98.4%) zeros Zeros
par_pri_emp has 127636 (68.4%) zeros Zeros
par_seg_emp has 184739 (99.0%) zeros Zeros
nov_emp_6m has 145294 (77.9%) zeros Zeros
def_emp_6m has 172034 (92.2%) zeros Zeros
qtd_sol_emp has 161559 (86.6%) zeros Zeros

Reproduction

Analysis started2020-10-02 18:39:36.128904
Analysis finished2020-10-02 18:43:42.410166
Duration4 minutes and 6.28 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct186523
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116450.0138
Minimum0
Maximum233153
Zeros1
Zeros (%)< 0.1%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile11628.1
Q158097.5
median116400
Q3174698.5
95-th percentile221434.9
Maximum233153
Range233153
Interquartile range (IQR)116601

Descriptive statistics

Standard deviation67312.08146
Coefficient of variation (CV)0.5780341219
Kurtosis-1.200723562
Mean116450.0138
Median Absolute Deviation (MAD)58301
Skewness0.001925345223
Sum2.172060592e+10
Variance4530916311
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
20471< 0.1%
 
894731< 0.1%
 
1754871< 0.1%
 
1734381< 0.1%
 
1795811< 0.1%
 
1775321< 0.1%
 
1672911< 0.1%
 
1713851< 0.1%
 
1693361< 0.1%
 
1959571< 0.1%
 
Other values (186513)186513> 99.9%
 
ValueCountFrequency (%) 
01< 0.1%
 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
2331531< 0.1%
 
2331521< 0.1%
 
2331511< 0.1%
 
2331501< 0.1%
 
2331491< 0.1%
 

id_pessoa
Real number (ℝ≥0)

UNIQUE

Distinct186523
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean535857.1509
Minimum417428
Maximum671084
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum417428
5-th percentile429177.1
Q1476752.5
median535850
Q3594974.5
95-th percentile642340.8
Maximum671084
Range253656
Interquartile range (IQR)118222

Descriptive statistics

Standard deviation68332.56085
Coefficient of variation (CV)0.1275201063
Kurtosis-1.197307771
Mean535857.1509
Median Absolute Deviation (MAD)59112
Skewness-0.002015233297
Sum9.994968336e+10
Variance4669338872
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
5263351< 0.1%
 
4540071< 0.1%
 
4458111< 0.1%
 
4478561< 0.1%
 
4212231< 0.1%
 
4191741< 0.1%
 
4232681< 0.1%
 
5031351< 0.1%
 
5010861< 0.1%
 
5072291< 0.1%
 
Other values (186513)186513> 99.9%
 
ValueCountFrequency (%) 
4174281< 0.1%
 
4174291< 0.1%
 
4174301< 0.1%
 
4174311< 0.1%
 
4174321< 0.1%
 
ValueCountFrequency (%) 
6710841< 0.1%
 
6710331< 0.1%
 
6586761< 0.1%
 
6586751< 0.1%
 
6586741< 0.1%
 

valor_emprestimo
Real number (ℝ≥0)

Distinct21749
Distinct (%)11.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54353.68279
Minimum13320
Maximum990572
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum13320
5-th percentile34939
Q147145
median53803
Q360401
95-th percentile74079
Maximum990572
Range977252
Interquartile range (IQR)13256

Descriptive statistics

Standard deviation13025.57436
Coefficient of variation (CV)0.2396447433
Kurtosis304.6867808
Mean54353.68279
Median Absolute Deviation (MAD)6644
Skewness5.254190427
Sum1.013821198e+10
Variance169665587.4
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
4834917260.9%
 
5330316950.9%
 
5130315780.8%
 
5030315690.8%
 
5525915070.8%
 
5230314850.8%
 
4734914620.8%
 
5725912970.7%
 
5625912840.7%
 
4634912770.7%
 
Other values (21739)17164392.0%
 
ValueCountFrequency (%) 
133201< 0.1%
 
133691< 0.1%
 
136001< 0.1%
 
136401< 0.1%
 
136644< 0.1%
 
ValueCountFrequency (%) 
9905721< 0.1%
 
9873541< 0.1%
 
5924601< 0.1%
 
3320451< 0.1%
 
2377791< 0.1%
 

custo_ativo
Real number (ℝ≥0)

Distinct42481
Distinct (%)22.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75854.33434
Minimum37000
Maximum1628992
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum37000
5-th percentile58251
Q165717.5
median70945
Q379181
95-th percentile109635.7
Maximum1628992
Range1591992
Interquartile range (IQR)13463.5

Descriptive statistics

Standard deviation19060.17616
Coefficient of variation (CV)0.2512733955
Kurtosis352.8904128
Mean75854.33434
Median Absolute Deviation (MAD)6192
Skewness6.93547847
Sum1.414857800e+10
Variance363290315.1
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
680005480.3%
 
670004790.3%
 
720004230.2%
 
700003970.2%
 
740003770.2%
 
730003720.2%
 
660003700.2%
 
750003640.2%
 
690003510.2%
 
650003200.2%
 
Other values (42471)18252297.9%
 
ValueCountFrequency (%) 
370002< 0.1%
 
371291< 0.1%
 
372301< 0.1%
 
373101< 0.1%
 
378161< 0.1%
 
ValueCountFrequency (%) 
16289921< 0.1%
 
13289541< 0.1%
 
7151861< 0.1%
 
4596251< 0.1%
 
3836001< 0.1%
 

emprestimo_custo
Real number (ℝ≥0)

Distinct6378
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.75372265
Minimum14.17
Maximum95
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum14.17
5-th percentile52.46
Q168.87
median76.8
Q383.67
95-th percentile89.38
Maximum95
Range80.83
Interquartile range (IQR)14.8

Descriptive statistics

Standard deviation11.4304345
Coefficient of variation (CV)0.1529078967
Kurtosis1.275395213
Mean74.75372265
Median Absolute Deviation (MAD)7.27
Skewness-1.069009619
Sum13943288.61
Variance130.6548328
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
8534931.9%
 
84.998200.4%
 
79.994480.2%
 
803940.2%
 
79.93320.2%
 
753290.2%
 
74.933120.2%
 
79.793110.2%
 
89.892740.1%
 
79.952690.1%
 
Other values (6368)17954196.3%
 
ValueCountFrequency (%) 
14.171< 0.1%
 
15.31< 0.1%
 
15.581< 0.1%
 
17.051< 0.1%
 
17.131< 0.1%
 
ValueCountFrequency (%) 
956< 0.1%
 
94.994< 0.1%
 
94.988< 0.1%
 
94.975< 0.1%
 
94.9611< 0.1%
 

agencia
Real number (ℝ≥0)

Distinct82
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.81785624
Minimum1
Maximum261
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile2
Q114
median61
Q3130
95-th percentile249
Maximum261
Range260
Interquartile range (IQR)116

Descriptive statistics

Standard deviation69.80282204
Coefficient of variation (CV)0.9585948507
Kurtosis0.3058193865
Mean72.81785624
Median Absolute Deviation (MAD)50
Skewness1.030977532
Sum13582205
Variance4872.433964
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
2104815.6%
 
6791164.9%
 
374194.0%
 
573964.0%
 
3670583.8%
 
13662593.4%
 
3462243.3%
 
1651682.8%
 
1946782.5%
 
145922.5%
 
Other values (72)11813263.3%
 
ValueCountFrequency (%) 
145922.5%
 
2104815.6%
 
374194.0%
 
573964.0%
 
725651.4%
 
ValueCountFrequency (%) 
2611410.1%
 
2602880.2%
 
2592850.2%
 
2583020.2%
 
2579800.5%
 

revendedora
Real number (ℝ≥0)

Distinct2924
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19635.29279
Minimum10524
Maximum24803
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum10524
5-th percentile14180
Q116535
median20333
Q323000
95-th percentile24119
Maximum24803
Range14279
Interquartile range (IQR)6465

Descriptive statistics

Standard deviation3491.178096
Coefficient of variation (CV)0.1778011732
Kurtosis-1.47618857
Mean19635.29279
Median Absolute Deviation (MAD)3058
Skewness-0.1680836109
Sum3662433718
Variance12188324.5
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1831711280.6%
 
1569410480.6%
 
1566310260.6%
 
1798010210.5%
 
1423410010.5%
 
181669600.5%
 
143759140.5%
 
219809110.5%
 
227278730.5%
 
141458470.5%
 
Other values (2914)17679494.8%
 
ValueCountFrequency (%) 
105246< 0.1%
 
123112< 0.1%
 
1231239< 0.1%
 
1237480< 0.1%
 
1244139< 0.1%
 
ValueCountFrequency (%) 
248032< 0.1%
 
248022< 0.1%
 
247991< 0.1%
 
247971< 0.1%
 
247931< 0.1%
 

montadora
Real number (ℝ≥0)

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.0078489
Minimum45
Maximum156
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum45
5-th percentile45
Q148
median86
Q386
95-th percentile86
Maximum156
Range111
Interquartile range (IQR)38

Descriptive statistics

Standard deviation22.11344458
Coefficient of variation (CV)0.3204482524
Kurtosis-0.7243704664
Mean69.0078489
Median Absolute Deviation (MAD)34
Skewness0.3859416686
Sum12871551
Variance489.0044312
MonotocityNot monotonic
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%) 
868772447.0%
 
454535424.3%
 
512175311.7%
 
48133467.2%
 
4981634.4%
 
12076374.1%
 
6719151.0%
 
1456160.3%
 
15310< 0.1%
 
1524< 0.1%
 
ValueCountFrequency (%) 
454535424.3%
 
48133467.2%
 
4981634.4%
 
512175311.7%
 
6719151.0%
 
ValueCountFrequency (%) 
1561< 0.1%
 
15310< 0.1%
 
1524< 0.1%
 
1456160.3%
 
12076374.1%
 

Current_pincode_ID
Real number (ℝ≥0)

Distinct6510
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3400.717863
Minimum1
Maximum7345
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile263
Q11511
median2972
Q35679
95-th percentile6943
Maximum7345
Range7344
Interquartile range (IQR)4168

Descriptive statistics

Standard deviation2238.301828
Coefficient of variation (CV)0.6581851004
Kurtosis-1.288888058
Mean3400.717863
Median Absolute Deviation (MAD)1920
Skewness0.2744225481
Sum634312098
Variance5009995.075
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
257815180.8%
 
144613780.7%
 
15158850.5%
 
29897460.4%
 
29437300.4%
 
15096930.4%
 
27826710.4%
 
17946410.3%
 
5716240.3%
 
33636010.3%
 
Other values (6500)17803695.4%
 
ValueCountFrequency (%) 
119< 0.1%
 
253< 0.1%
 
337< 0.1%
 
470< 0.1%
 
51780.1%
 
ValueCountFrequency (%) 
73456< 0.1%
 
73441< 0.1%
 
73431< 0.1%
 
73421< 0.1%
 
73414< 0.1%
 

nascimento
Categorical

HIGH CARDINALITY

Distinct15116
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
01-01-90
 
1735
01-01-88
 
1719
01-01-87
 
1693
01-01-86
 
1657
01-01-85
 
1620
Other values (15111)
178099 
ValueCountFrequency (%) 
01-01-9017350.9%
 
01-01-8817190.9%
 
01-01-8716930.9%
 
01-01-8616570.9%
 
01-01-8516200.9%
 
01-01-8915860.9%
 
01-01-9115800.8%
 
01-01-9515520.8%
 
01-01-9215500.8%
 
01-01-9315350.8%
 
Other values (15106)17029691.3%
 
Frequencies of value counts

Unique

Unique1666 ?
Unique (%)0.9%
Histogram of lengths of the category

Length

Max length8
Median length8
Mean length8
Min length8

emprego
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing6147
Missing (%)3.3%
Memory size1.4 MiB
Self employed
102028 
Salaried
78348 
ValueCountFrequency (%) 
Self employed10202854.7%
 
Salaried7834842.0%
 
(Missing)61473.3%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length13
Median length13
Mean length10.57021922
Min length3

data_contrato
Categorical

HIGH CARDINALITY

Distinct84
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
31-10-18
 
7033
24-10-18
 
5366
31-08-18
 
5351
23-10-18
 
5182
26-10-18
 
4961
Other values (79)
158630 
ValueCountFrequency (%) 
31-10-1870333.8%
 
24-10-1853662.9%
 
31-08-1853512.9%
 
23-10-1851822.8%
 
26-10-1849612.7%
 
25-10-1847152.5%
 
30-10-1847052.5%
 
22-10-1846922.5%
 
30-08-1837792.0%
 
29-10-1834971.9%
 
Other values (74)13724273.6%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length8
Median length8
Mean length8
Min length8

estado
Real number (ℝ≥0)

Distinct22
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.259833908
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q310
95-th percentile16
Maximum22
Range21
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.478373964
Coefficient of variation (CV)0.6168700304
Kurtosis-0.3275150184
Mean7.259833908
Median Absolute Deviation (MAD)3
Skewness0.8217260468
Sum1354126
Variance20.05583336
MonotocityNot monotonic
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%) 
43573619.2%
 
32736014.7%
 
62681714.4%
 
13143207.7%
 
9128806.9%
 
8113886.1%
 
581934.4%
 
1474654.0%
 
171403.8%
 
754272.9%
 
Other values (12)2979716.0%
 
ValueCountFrequency (%) 
171403.8%
 
233191.8%
 
32736014.7%
 
43573619.2%
 
581934.4%
 
ValueCountFrequency (%) 
2257< 0.1%
 
211240.1%
 
201460.1%
 
198270.4%
 
1842882.3%
 

funcionario
Real number (ℝ≥0)

Distinct3258
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1548.582807
Minimum1
Maximum3795
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile149
Q1712
median1451
Q32358
95-th percentile3185
Maximum3795
Range3794
Interquartile range (IQR)1646

Descriptive statistics

Standard deviation974.7103402
Coefficient of variation (CV)0.6294208714
Kurtosis-1.050335125
Mean1548.582807
Median Absolute Deviation (MAD)810
Skewness0.2455820427
Sum288846311
Variance950060.2473
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
25464970.3%
 
6203990.2%
 
2553940.2%
 
21533340.2%
 
1303320.2%
 
9562880.2%
 
7512870.2%
 
14942860.2%
 
642850.2%
 
9082810.2%
 
Other values (3248)18314098.2%
 
ValueCountFrequency (%) 
165< 0.1%
 
31160.1%
 
460< 0.1%
 
575< 0.1%
 
71140.1%
 
ValueCountFrequency (%) 
37951< 0.1%
 
37941< 0.1%
 
37913< 0.1%
 
37901< 0.1%
 
37892< 0.1%
 

flag_telefone
Boolean

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
1
186523 
ValueCountFrequency (%) 
1186523100.0%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
1
156735 
0
29788 
ValueCountFrequency (%) 
115673584.0%
 
02978816.0%
 

flag_pan
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
172460 
1
 
14063
ValueCountFrequency (%) 
017246092.5%
 
1140637.5%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
159461 
1
27062 
ValueCountFrequency (%) 
015946185.5%
 
12706214.5%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
182234 
1
 
4289
ValueCountFrequency (%) 
018223497.7%
 
142892.3%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
186131 
1
 
392
ValueCountFrequency (%) 
018613199.8%
 
13920.2%
 

score
Real number (ℝ≥0)

ZEROS

Distinct573
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean288.9711832
Minimum0
Maximum890
Zeros93698
Zeros (%)50.2%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3678
95-th percentile825
Maximum890
Range890
Interquartile range (IQR)678

Descriptive statistics

Standard deviation338.3142486
Coefficient of variation (CV)1.170754277
Kurtosis-1.63277166
Mean288.9711832
Median Absolute Deviation (MAD)0
Skewness0.4482240891
Sum53899772
Variance114456.5308
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
09369850.2%
 
30070643.8%
 
73869143.7%
 
82558713.1%
 
1530001.6%
 
1729261.6%
 
76324061.3%
 
1623181.2%
 
70816690.9%
 
73716090.9%
 
Other values (563)5904831.7%
 
ValueCountFrequency (%) 
09369850.2%
 
113< 0.1%
 
147670.4%
 
1530001.6%
 
1623181.2%
 
ValueCountFrequency (%) 
8903< 0.1%
 
8841< 0.1%
 
87951< 0.1%
 
8785< 0.1%
 
8737< 0.1%
 

score_desc
Categorical

Distinct20
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
No Bureau History Available
93698 
C-Very Low Risk
12838 
A-Very Low Risk
11278 
D-Very Low Risk
 
9093
B-Very Low Risk
 
7332
Other values (15)
52284 
ValueCountFrequency (%) 
No Bureau History Available9369850.2%
 
C-Very Low Risk128386.9%
 
A-Very Low Risk112786.0%
 
D-Very Low Risk90934.9%
 
B-Very Low Risk73323.9%
 
M-Very High Risk70643.8%
 
F-Low Risk68473.7%
 
K-High Risk65663.5%
 
H-Medium Risk54492.9%
 
E-Low Risk46282.5%
 
Other values (10)2173011.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length55
Median length27
Mean length22.21520134
Min length10

pri_qtd_tot_emp
Real number (ℝ≥0)

ZEROS

Distinct104
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.434048348
Minimum0
Maximum453
Zeros93698
Zeros (%)50.2%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33
95-th percentile11
Maximum453
Range453
Interquartile range (IQR)3

Descriptive statistics

Standard deviation5.24306598
Coefficient of variation (CV)2.154051699
Kurtosis495.3134747
Mean2.434048348
Median Absolute Deviation (MAD)0
Skewness10.71977469
Sum454006
Variance27.48974087
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
09369850.2%
 
12794315.0%
 
2158268.5%
 
3103625.6%
 
474554.0%
 
557953.1%
 
644662.4%
 
735461.9%
 
827941.5%
 
923001.2%
 
Other values (94)123386.6%
 
ValueCountFrequency (%) 
09369850.2%
 
12794315.0%
 
2158268.5%
 
3103625.6%
 
474554.0%
 
ValueCountFrequency (%) 
4531< 0.1%
 
3541< 0.1%
 
2711< 0.1%
 
1941< 0.1%
 
1482< 0.1%
 

pri_qtd_tot_emp_atv
Real number (ℝ≥0)

ZEROS

Distinct39
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.036268986
Minimum0
Maximum144
Zeros109771
Zeros (%)58.9%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile5
Maximum144
Range144
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.942854644
Coefficient of variation (CV)1.874855536
Kurtosis188.0254518
Mean1.036268986
Median Absolute Deviation (MAD)0
Skewness5.823731476
Sum193288
Variance3.774684168
MonotocityNot monotonic
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%) 
010977158.9%
 
13358318.0%
 
2172579.3%
 
397595.2%
 
459783.2%
 
536101.9%
 
622171.2%
 
714300.8%
 
89540.5%
 
95880.3%
 
Other values (29)13760.7%
 
ValueCountFrequency (%) 
010977158.9%
 
13358318.0%
 
2172579.3%
 
397595.2%
 
459783.2%
 
ValueCountFrequency (%) 
1441< 0.1%
 
651< 0.1%
 
521< 0.1%
 
431< 0.1%
 
421< 0.1%
 

pri_qtd_tot_def
Real number (ℝ≥0)

ZEROS

Distinct20
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.156045099
Minimum0
Maximum23
Zeros165616
Zeros (%)88.8%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum23
Range23
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.5457297021
Coefficient of variation (CV)3.497256278
Kurtosis104.0289984
Mean0.156045099
Median Absolute Deviation (MAD)0
Skewness7.068909116
Sum29106
Variance0.2978209077
MonotocityNot monotonic
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%) 
016561688.8%
 
1158518.5%
 
234371.8%
 
39690.5%
 
43220.2%
 
51440.1%
 
676< 0.1%
 
732< 0.1%
 
822< 0.1%
 
919< 0.1%
 
Other values (10)35< 0.1%
 
ValueCountFrequency (%) 
016561688.8%
 
1158518.5%
 
234371.8%
 
39690.5%
 
43220.2%
 
ValueCountFrequency (%) 
231< 0.1%
 
191< 0.1%
 
171< 0.1%
 
161< 0.1%
 
151< 0.1%
 

pri_emp_abt
Real number (ℝ)

SKEWED
ZEROS

Distinct58811
Distinct (%)31.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean166181.9059
Minimum-2018309
Maximum96524920
Zeros113463
Zeros (%)60.8%
Memory size1.4 MiB

Quantile statistics

Minimum-2018309
5-th percentile0
Q10
median0
Q334860
95-th percentile801337.7
Maximum96524920
Range98543229
Interquartile range (IQR)34860

Descriptive statistics

Standard deviation967065.0273
Coefficient of variation (CV)5.81931602
Kurtosis1709.788142
Mean166181.9059
Median Absolute Deviation (MAD)0
Skewness30.69564031
Sum3.099674764e+10
Variance9.35214767e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
011346360.8%
 
8001000.1%
 
400960.1%
 
10000071< 0.1%
 
5000068< 0.1%
 
4000067< 0.1%
 
3000067< 0.1%
 
2500060< 0.1%
 
2000055< 0.1%
 
6000052< 0.1%
 
Other values (58801)7242438.8%
 
ValueCountFrequency (%) 
-20183091< 0.1%
 
-17384151< 0.1%
 
-14083141< 0.1%
 
-13064491< 0.1%
 
-11782421< 0.1%
 
ValueCountFrequency (%) 
965249201< 0.1%
 
756034001< 0.1%
 
664061601< 0.1%
 
635313201< 0.1%
 
633590401< 0.1%
 

pri_emp_san
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct36559
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean219739.6944
Minimum0
Maximum1000000000
Zeros110630
Zeros (%)59.3%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q362000
95-th percentile1030000
Maximum1000000000
Range1000000000
Interquartile range (IQR)62000

Descriptive statistics

Standard deviation2602928.113
Coefficient of variation (CV)11.84550711
Kurtosis116769.9394
Mean219739.6944
Median Absolute Deviation (MAD)0
Skewness306.8631852
Sum4.098650702e+10
Variance6.77523476e+12
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
011063059.3%
 
5000012040.6%
 
3000011330.6%
 
1000007820.4%
 
250007520.4%
 
400007020.4%
 
200006850.4%
 
600004760.3%
 
2000004720.3%
 
150004450.2%
 
Other values (36549)6924237.1%
 
ValueCountFrequency (%) 
011063059.3%
 
129< 0.1%
 
218< 0.1%
 
312< 0.1%
 
419< 0.1%
 
ValueCountFrequency (%) 
10000000001< 0.1%
 
1058657121< 0.1%
 
1004250001< 0.1%
 
926228161< 0.1%
 
803275601< 0.1%
 

pri_emp_tom
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct39362
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean219339.6203
Minimum0
Maximum1000000000
Zeros110717
Zeros (%)59.4%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q360067
95-th percentile1027987
Maximum1000000000
Range1000000000
Interquartile range (IQR)60067

Descriptive statistics

Standard deviation2606129.24
Coefficient of variation (CV)11.88170763
Kurtosis116198.2453
Mean219339.6203
Median Absolute Deviation (MAD)0
Skewness305.7799348
Sum4.091188399e+10
Variance6.791909615e+12
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
011071759.4%
 
5000011250.6%
 
3000010540.6%
 
1000007650.4%
 
400006380.3%
 
250005980.3%
 
200005300.3%
 
2000004790.3%
 
3000004410.2%
 
600004240.2%
 
Other values (39352)6975237.4%
 
ValueCountFrequency (%) 
011071759.4%
 
134< 0.1%
 
219< 0.1%
 
312< 0.1%
 
418< 0.1%
 
ValueCountFrequency (%) 
10000000001< 0.1%
 
1057557121< 0.1%
 
1004250001< 0.1%
 
926287281< 0.1%
 
803491681< 0.1%
 

sec_qtd_tot_emp
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct37
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0609897975
Minimum0
Maximum52
Zeros181821
Zeros (%)97.5%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum52
Range52
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6570413928
Coefficient of variation (CV)10.7729722
Kurtosis1283.769316
Mean0.0609897975
Median Absolute Deviation (MAD)0
Skewness28.32622344
Sum11376
Variance0.4317033919
MonotocityNot monotonic
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%) 
018182197.5%
 
127451.5%
 
28300.4%
 
33620.2%
 
42380.1%
 
51190.1%
 
61000.1%
 
860< 0.1%
 
758< 0.1%
 
933< 0.1%
 
Other values (27)1570.1%
 
ValueCountFrequency (%) 
018182197.5%
 
127451.5%
 
28300.4%
 
33620.2%
 
42380.1%
 
ValueCountFrequency (%) 
521< 0.1%
 
462< 0.1%
 
421< 0.1%
 
382< 0.1%
 
371< 0.1%
 

sec_qtd_tot_emp_atv
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct22
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.02841472633
Minimum0
Maximum36
Zeros183450
Zeros (%)98.4%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum36
Range36
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3256757753
Coefficient of variation (CV)11.4615137
Kurtosis1835.099719
Mean0.02841472633
Median Absolute Deviation (MAD)0
Skewness31.15570999
Sum5300
Variance0.1060647106
MonotocityNot monotonic
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%) 
018345098.4%
 
121291.1%
 
25260.3%
 
31590.1%
 
41040.1%
 
550< 0.1%
 
627< 0.1%
 
721< 0.1%
 
815< 0.1%
 
910< 0.1%
 
Other values (12)32< 0.1%
 
ValueCountFrequency (%) 
018345098.4%
 
121291.1%
 
25260.3%
 
31590.1%
 
41040.1%
 
ValueCountFrequency (%) 
361< 0.1%
 
261< 0.1%
 
222< 0.1%
 
211< 0.1%
 
201< 0.1%
 

sec_qtd_tot_def
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.007511138037
Minimum0
Maximum8
Zeros185430
Zeros (%)99.4%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum8
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1149469213
Coefficient of variation (CV)15.30352934
Kurtosis866.3538733
Mean0.007511138037
Median Absolute Deviation (MAD)0
Skewness24.27761841
Sum1401
Variance0.01321279471
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
018543099.4%
 
19160.5%
 
21050.1%
 
338< 0.1%
 
419< 0.1%
 
58< 0.1%
 
65< 0.1%
 
81< 0.1%
 
71< 0.1%
 
ValueCountFrequency (%) 
018543099.4%
 
19160.5%
 
21050.1%
 
338< 0.1%
 
419< 0.1%
 
ValueCountFrequency (%) 
81< 0.1%
 
71< 0.1%
 
65< 0.1%
 
58< 0.1%
 
419< 0.1%
 

sec_emp_abt
Real number (ℝ)

HIGH CORRELATION
SKEWED
ZEROS

Distinct2625
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5814.831436
Minimum-574647
Maximum36032852
Zeros183819
Zeros (%)98.6%
Memory size1.4 MiB

Quantile statistics

Minimum-574647
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum36032852
Range36607499
Interquartile range (IQR)0

Descriptive statistics

Standard deviation184752.4064
Coefficient of variation (CV)31.77261601
Kurtosis15462.23426
Mean5814.831436
Median Absolute Deviation (MAD)0
Skewness104.3726495
Sum1084599804
Variance3.413345167e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
018381998.6%
 
8008< 0.1%
 
4007< 0.1%
 
1006< 0.1%
 
5895< 0.1%
 
12004< 0.1%
 
10704< 0.1%
 
200003< 0.1%
 
50003< 0.1%
 
129503< 0.1%
 
Other values (2615)26611.4%
 
ValueCountFrequency (%) 
-5746471< 0.1%
 
-1555271< 0.1%
 
-1171381< 0.1%
 
-312901< 0.1%
 
-200001< 0.1%
 
ValueCountFrequency (%) 
360328521< 0.1%
 
295605401< 0.1%
 
246920241< 0.1%
 
224971721< 0.1%
 
196382801< 0.1%
 

sec_emp_san
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct1856
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7787.940903
Minimum0
Maximum30000000
Zeros183518
Zeros (%)98.4%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum30000000
Range30000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation197362.8548
Coefficient of variation (CV)25.34210997
Kurtosis7966.347359
Mean7787.940903
Median Absolute Deviation (MAD)0
Skewness73.22473392
Sum1452630101
Variance3.895209645e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
018351898.4%
 
5000065< 0.1%
 
10000051< 0.1%
 
20000033< 0.1%
 
4000032< 0.1%
 
3000031< 0.1%
 
1000026< 0.1%
 
1500026< 0.1%
 
2500026< 0.1%
 
2000025< 0.1%
 
Other values (1846)26901.4%
 
ValueCountFrequency (%) 
018351898.4%
 
13< 0.1%
 
82< 0.1%
 
91< 0.1%
 
181< 0.1%
 
ValueCountFrequency (%) 
300000001< 0.1%
 
268882001< 0.1%
 
250000001< 0.1%
 
198000001< 0.1%
 
186910021< 0.1%
 

sec_emp_tom
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct2104
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7668.266058
Minimum0
Maximum30000000
Zeros183542
Zeros (%)98.4%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum30000000
Range30000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation196767.7237
Coefficient of variation (CV)25.66000216
Kurtosis8056.586376
Mean7668.266058
Median Absolute Deviation (MAD)0
Skewness73.71455172
Sum1430307990
Variance3.871753707e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
018354298.4%
 
5000046< 0.1%
 
10000040< 0.1%
 
20000031< 0.1%
 
4000026< 0.1%
 
50000022< 0.1%
 
30000022< 0.1%
 
3000019< 0.1%
 
15000018< 0.1%
 
40000017< 0.1%
 
Other values (2094)27401.5%
 
ValueCountFrequency (%) 
018354298.4%
 
13< 0.1%
 
82< 0.1%
 
91< 0.1%
 
181< 0.1%
 
ValueCountFrequency (%) 
300000001< 0.1%
 
268882001< 0.1%
 
250000001< 0.1%
 
198000001< 0.1%
 
186910021< 0.1%
 

par_pri_emp
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct24658
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13382.56195
Minimum0
Maximum25642806
Zeros127636
Zeros (%)68.4%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31998.5
95-th percentile26629.3
Maximum25642806
Range25642806
Interquartile range (IQR)1998.5

Descriptive statistics

Standard deviation161858.1317
Coefficient of variation (CV)12.09470446
Kurtosis7775.502538
Mean13382.56195
Median Absolute Deviation (MAD)0
Skewness70.11203895
Sum2496155603
Variance2.61980548e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
012763668.4%
 
16202340.1%
 
15001200.1%
 
16001150.1%
 
20001130.1%
 
25001130.1%
 
11491080.1%
 
1250940.1%
 
170093< 0.1%
 
156580< 0.1%
 
Other values (24648)5781731.0%
 
ValueCountFrequency (%) 
012763668.4%
 
13< 0.1%
 
23< 0.1%
 
317< 0.1%
 
411< 0.1%
 
ValueCountFrequency (%) 
256428061< 0.1%
 
207665531< 0.1%
 
174088221< 0.1%
 
155185461< 0.1%
 
154204111< 0.1%
 

par_seg_emp
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct1593
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean341.3471904
Minimum0
Maximum4170901
Zeros184739
Zeros (%)99.0%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum4170901
Range4170901
Interquartile range (IQR)0

Descriptive statistics

Standard deviation16471.8578
Coefficient of variation (CV)48.25543687
Kurtosis32130.06834
Mean341.3471904
Median Absolute Deviation (MAD)0
Skewness153.6010717
Sum63669102
Variance271322099.4
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
018473999.0%
 
50005< 0.1%
 
10655< 0.1%
 
8335< 0.1%
 
12325< 0.1%
 
21005< 0.1%
 
15655< 0.1%
 
24004< 0.1%
 
16504< 0.1%
 
20654< 0.1%
 
Other values (1583)17420.9%
 
ValueCountFrequency (%) 
018473999.0%
 
12< 0.1%
 
21< 0.1%
 
31< 0.1%
 
52< 0.1%
 
ValueCountFrequency (%) 
41709011< 0.1%
 
32467101< 0.1%
 
18140001< 0.1%
 
15899461< 0.1%
 
14476001< 0.1%
 

nov_emp_6m
Real number (ℝ≥0)

ZEROS

Distinct25
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3815079106
Minimum0
Maximum35
Zeros145294
Zeros (%)77.9%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum35
Range35
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.9578173187
Coefficient of variation (CV)2.510609327
Kurtosis50.46994352
Mean0.3815079106
Median Absolute Deviation (MAD)0
Skewness4.940283871
Sum71160
Variance0.917414016
MonotocityNot monotonic
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%) 
014529477.9%
 
12556013.7%
 
288494.7%
 
335661.9%
 
415570.8%
 
57570.4%
 
63740.2%
 
72460.1%
 
81260.1%
 
966< 0.1%
 
Other values (15)1280.1%
 
ValueCountFrequency (%) 
014529477.9%
 
12556013.7%
 
288494.7%
 
335661.9%
 
415570.8%
 
ValueCountFrequency (%) 
351< 0.1%
 
281< 0.1%
 
232< 0.1%
 
211< 0.1%
 
203< 0.1%
 

def_emp_6m
Real number (ℝ≥0)

ZEROS

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.09701752599
Minimum0
Maximum20
Zeros172034
Zeros (%)92.2%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum20
Range20
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3836828182
Coefficient of variation (CV)3.95477842
Kurtosis103.5947152
Mean0.09701752599
Median Absolute Deviation (MAD)0
Skewness6.708168452
Sum18096
Variance0.147212505
MonotocityNot monotonic
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
017203492.2%
 
1119056.4%
 
219561.0%
 
34320.2%
 
41060.1%
 
549< 0.1%
 
616< 0.1%
 
712< 0.1%
 
85< 0.1%
 
122< 0.1%
 
Other values (4)6< 0.1%
 
ValueCountFrequency (%) 
017203492.2%
 
1119056.4%
 
219561.0%
 
34320.2%
 
41060.1%
 
ValueCountFrequency (%) 
201< 0.1%
 
122< 0.1%
 
112< 0.1%
 
101< 0.1%
 
92< 0.1%
 

tem_med_emp
Categorical

HIGH CARDINALITY

Distinct189
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0yrs 0mon
95599 
0yrs 6mon
 
4766
0yrs 7mon
 
4299
0yrs 11mon
 
4214
0yrs 10mon
 
4091
Other values (184)
73554 
ValueCountFrequency (%) 
0yrs 0mon9559951.3%
 
0yrs 6mon47662.6%
 
0yrs 7mon42992.3%
 
0yrs 11mon42142.3%
 
0yrs 10mon40912.2%
 
0yrs 9mon40402.2%
 
1yrs 0mon39752.1%
 
0yrs 8mon38772.1%
 
1yrs 1mon35881.9%
 
0yrs 5mon35291.9%
 
Other values (179)5454529.2%
 
Frequencies of value counts

Unique

Unique25 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length11
Median length9
Mean length9.077019992
Min length9

tem_pri_emp
Categorical

HIGH CARDINALITY

Distinct291
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0yrs 0mon
95405 
0yrs 6mon
 
3795
2yrs 1mon
 
3776
0yrs 7mon
 
3234
2yrs 0mon
 
3051
Other values (286)
77262 
ValueCountFrequency (%) 
0yrs 0mon9540551.1%
 
0yrs 6mon37952.0%
 
2yrs 1mon37762.0%
 
0yrs 7mon32341.7%
 
2yrs 0mon30511.6%
 
1yrs 0mon26721.4%
 
1yrs 1mon24441.3%
 
0yrs 11mon20521.1%
 
0yrs 8mon19371.0%
 
0yrs 9mon19001.0%
 
Other values (281)6625735.5%
 
Frequencies of value counts

Unique

Unique45 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length11
Median length9
Mean length9.090428526
Min length9

qtd_sol_emp
Real number (ℝ≥0)

ZEROS

Distinct23
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2061515202
Minimum0
Maximum36
Zeros161559
Zeros (%)86.6%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum36
Range36
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.7017413349
Coefficient of variation (CV)3.404007568
Kurtosis127.6281962
Mean0.2061515202
Median Absolute Deviation (MAD)0
Skewness7.710467395
Sum38452
Variance0.4924409011
MonotocityNot monotonic
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%) 
016155986.6%
 
1178439.6%
 
243572.3%
 
313890.7%
 
46080.3%
 
52650.1%
 
61930.1%
 
71030.1%
 
887< 0.1%
 
933< 0.1%
 
Other values (13)86< 0.1%
 
ValueCountFrequency (%) 
016155986.6%
 
1178439.6%
 
243572.3%
 
313890.7%
 
46080.3%
 
ValueCountFrequency (%) 
361< 0.1%
 
231< 0.1%
 
221< 0.1%
 
195< 0.1%
 
183< 0.1%
 

default
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
146161 
1
40362 
ValueCountFrequency (%) 
014616178.4%
 
14036221.6%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

df_indexid_pessoavalor_emprestimocusto_ativoemprestimo_custoagenciarevendedoramontadoraCurrent_pincode_IDnascimentoempregodata_contratoestadofuncionarioflag_telefoneflag_aadharflag_panflag_eleitorflag_cmotoristaflag_passaportescorescore_descpri_qtd_tot_emppri_qtd_tot_emp_atvpri_qtd_tot_defpri_emp_abtpri_emp_sanpri_emp_tomsec_qtd_tot_empsec_qtd_tot_emp_atvsec_qtd_tot_defsec_emp_abtsec_emp_sansec_emp_tompar_pri_emppar_seg_empnov_emp_6mdef_emp_6mtem_med_emptem_pri_empqtd_sol_empdefault
0155653487469634187557185.001361418986378310-05-76Salaried03-09-1882064110000676F-Low Risk4116213651901454300145430000000037468000300yrs 8mon5yrs 5mon00
198628627194424946904265.1812205645492305-02-97Self employed26-10-1831298100100824A-Very Low Risk10000000000000001yrs 1mon1yrs 1mon00
2132937636647569096940784.8651798086339606-05-83Salaried29-10-189679100100755C-Very Low Risk144012162978262666400000000400yrs 5mon1yrs 4mon00
329031518430694888278285.00191437586183807-01-83Self employed19-09-184603110000604H-Medium Risk520686648000080000000000118400201yrs 1mon3yrs 1mon70
467486577759549636678384.90742292886257801-01-94Salaried14-10-184286110000737C-Very Low Risk31036654571315713100000026580000yrs 10mon1yrs 4mon00
545170576606526786497082.501471769445278216-06-92Salaried13-10-1822421100000No Bureau History Available00000000000000000yrs 0mon0yrs 0mon01
6158563543198490787722166.042552341545582901-01-80Self employed27-09-18320791100000No Bureau History Available00000000000000000yrs 0mon0yrs 0mon01
785119459958583427219983.8013522831120169426-03-79Self employed23-08-184175110000300M-Very High Risk3223391691144500114450000000000001yrs 4mon1yrs 11mon00
890337598106629477663884.811032053545700417-09-95Self employed22-10-1871281100100612H-Medium Risk1104269421684216800000022060011yrs 11mon1yrs 11mon00
976243622880685178850079.322482333751177426-07-96Salaried26-10-184422110000738C-Very Low Risk31079169920009200000000000000yrs 11mon1yrs 3mon00

Last rows

df_indexid_pessoavalor_emprestimocusto_ativoemprestimo_custoagenciarevendedoramontadoraCurrent_pincode_IDnascimentoempregodata_contratoestadofuncionarioflag_telefoneflag_aadharflag_panflag_eleitorflag_cmotoristaflag_passaportescorescore_descpri_qtd_tot_emppri_qtd_tot_emp_atvpri_qtd_tot_defpri_emp_abtpri_emp_sanpri_emp_tomsec_qtd_tot_empsec_qtd_tot_emp_atvsec_qtd_tot_defsec_emp_abtsec_emp_sansec_emp_tompar_pri_emppar_seg_empnov_emp_6mdef_emp_6mtem_med_emptem_pri_empqtd_sol_empdefault
186513170573510770567596714386.3815157338630001-01-78Self employed16-09-1811227511000016Not Scored: No Activity seen on the customer (Inactive)10000032039567964050000404861701100001yrs 1mon2yrs 2mon00
18651496696626144589478160074.751042153149714811-01-94Self employed26-10-18109001100000No Bureau History Available00000000000000000yrs 0mon0yrs 0mon10
18651536985429239515786800077.942481479145178921-07-94Salaried09-08-18416611000017Not Scored: Not Enough Info available on the customer11111100000000008yrs 5mon8yrs 5mon00
186516176963525238576597126083.22181411586269401-01-80Self employed21-09-1842605110000309L-Very High Risk63234668835750035750000000000021yrs 2mon1yrs 7mon00
186517139443451783518036600280.3051414586334510-09-80Salaried21-08-189841100100483K-High Risk3613833043060199060199000000089810202yrs 0mon10yrs 1mon00
186518158885578514510787100873.23292437749593001-12-97NaN15-10-18320821100000No Bureau History Available00000000000000000yrs 0mon0yrs 0mon00
186519144610557346386946604261.0222256145238112-07-93Salaried04-10-1845361100000No Bureau History Available00000000000000000yrs 0mon0yrs 0mon00
186520204677647122519136681078.75841743386293301-01-61Self employed31-10-18212767110000300M-Very High Risk95214182171462307146230700000074280002yrs 7mon5yrs 3mon11
18652168304459266600077146085.45761659686442626-05-79Self employed23-08-188693110000783B-Very Low Risk53015775652057000205700000000012600101yrs 7mon2yrs 10mon00
18652257003467189530786252687.96201400486623901-01-82Self employed27-08-18519031100000No Bureau History Available00000000000000000yrs 0mon0yrs 0mon01